Online learning with Erdos-Renyi side-observation graphs

نویسندگان

Tomás Kocák

Gergely Neu

Michal Valko

چکیده

We consider adversarial multi-armed bandit problems where the learner is allowed to observe losses of a number of arms beside the arm that it actually chose. We study the case where all non-chosen arms reveal their loss with an unknown probability rt, independently of each other and the action of the learner. Moreover, we allow rt to change in every round t, which rules out the possibility of estimating rt by a well-concentrated sample average. We propose an algorithm which operates under the assumption that rt is large enough to warrant at least one side observation with high probability. We show that after T rounds in a bandit problem with N arms, the expected regret of our algorithm is of order O (√∑T t=1(1/rt) logN ) , given that rt ≥ log T/(2N − 2) for all t. All our bounds are within logarithmic factors of the best achievable performance of any algorithm that is even allowed to know exact values of rt.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effective Comparison of Graph Clustering Algorithms via Random Graphs

Many graph clustering algorithms have been proposed in recent past researches, each algorithm having its own advantages and drawbacks. All these algorithms rely on a very different approach so it’s really hard to say that which one is the most efficient and optimal if we talk in the sense of performance. It is really hard to decide that which algorithm is beneficial in case of highly complex ne...

متن کامل

Exact learning curves for Gaussian process regression on community random graphs

We study learning curves for Gaussian process regression which characterise performance in terms of the Bayes error averaged over datasets of a given size. Whilst learning curves are in general very difficult to calculate we show that for discrete input domains, where similarity between input points is characterized in terms nodes on a graph, accurate predictions can be obtained. These should i...

متن کامل

Strongly balanced graphs and random graphs

The concept of strongly balanced graph is introduced. It is shown that there exists a strongly balanced graph with u vertices and e edges if and only if I s u 1 s e s (2"). This result is applied to a classic question of Erdos and Renyi: What is the probability that a random graph on n vertices contains a given graph? A rooted version of this problem is also solved.

متن کامل

CS 598 : Theoretical Machine Learning

If this is was what we had from the start then the task of clustering would be trivial. However, usually, the graphs that must be clustered are not this perfect and contain edges between S1 and S2. These edges can be considered noise in the representation of G. Therefore, a clustering algorithm would be attempting to cluster a noisy representation, G ′ , of the perfect graph G. More explicitly,...

متن کامل

Exact learning curves for Gaussian process regression on large random graphs

We study learning curves for Gaussian process regression which characterise performance in terms of the Bayes error averaged over datasets of a given size. Whilst learning curves are in general very difficult to calculate we show that for discrete input domains, where similarity between input points is characterised in terms of a graph, accurate predictions can be obtained. These should in fact...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Online learning with Erdos-Renyi side-observation graphs

نویسندگان

چکیده

منابع مشابه

An Effective Comparison of Graph Clustering Algorithms via Random Graphs

Exact learning curves for Gaussian process regression on community random graphs

Strongly balanced graphs and random graphs

CS 598 : Theoretical Machine Learning

Exact learning curves for Gaussian process regression on large random graphs

عنوان ژورنال:

اشتراک گذاری